In The Claims: 



1 . (Currently Amended) A system for optimizing speech recognition 
procedures, comprising: 

initial language models each iteratively created by combining source 
models according to interpolation coefficients that define 
proportional relationships for combining said source models; 

a speech recognizer that utilizes said initial language models to iteratively 
process input development data in corresponding ones of said speech 
recognition procedures for calculating word-error rates that each 
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correspond to a different one of said initial language models; and 
an optimized language model selected from said initial language models by 

identifying an optimal word-error rate from among said word-error rates, 
said speech recognizer utilizing said optimized language model for 
performing subsequent ones of said speech recognition procedures. 

2. (Original) The system of claim 1 wherein said word-error rates are calculated 
by comparing a correct transcription of said input development data and a top 
recognition candidate from an N-best list that is rescored by a rescoring module for 
each of said initial language models. 

3. (Original) The system of claim 1 wherein said initial language models are 
implemented as statistical language models that include N-grams and probability 
values that each correspond to one of said N-grams. 

4. (Currently Amended) The system of claim 1 wherein said input development 
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data includes a pre-defined series of spoken word sequences from which said 
recognizer rescores a corresponding N-best list for calculating said word-error rates. 
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* * 



5. (Currently Amended) The system of claim 1 wherein said source models are 
each similarly implemented as statistical language models that include N-grams and 
probability values that each correspond to one of said N-grams , each of said N-grams 
being the same in all of said source models . 

6. (Original) The system of claim 1 wherein each of said source models 
corresponds to a different application domain that is related to a particular speech 
environment. 

7. (Original) The system of claim 1 wherein sets of said interpolation coefficients 
are each associated with a different one of said source models to define how much 
said different one of said source models contributes to a corresponding one of said 
initial language models. 

8. (Original) The system of claim 1 wherein said interpolation coefficients are 
each multiplied with a different one of said source models to produce a series of 
weighted source models that are then combined to produce a corresponding one of 
said initial language models. 

9. (Original) The system of claim 1 wherein said initial language models are each 
calculated by a formula: 

LM = Ai SMi + A 2 SM 2 + . . . + A n SM n 

where said LM is one of said initial language models, said SMi is a first one of said 
source models, said SM n is a final one of said source models in a continuous 
sequence of "n" source models, and said Ai, said A2, and said A n are said 
interpolation coefficients applied to respective probability values of said source 
models to weight how much each of said source models contributes to said one of 
said initial language models. 
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10. (Original) The system of claim 1 wherein said interpolation coefficients are 
each greater than or equal to "0", and are also each less than or equal to "1", a sum 
of all of said interpolation coefficients being equal to "1". 

1 1 . (Original) The system of claim 1 wherein said interpolation coefficients for 
creating said optimized language model are selectively chosen by analyzing effects of 
various combinations of said interpolation coefficients upon said word-error rates 
that correspond to recognition accuracy characteristics of said speech recognizer, 
said optimized language model being directly implemented by minimizing said 
optimal word-error rate through a selection of said interpolation coefficients. 

12. (Original) The system of claim 1 wherein a rescoring module repeatedly 
processes said input development data to rescore an N-best list of recognition 
candidates for calculating said word-error rates by comparing a top recognition 
candidate to said input development data, said recognition candidates each 
including a recognition result in a text format, and a corresponding recognition 
score. 

13. (Original) The system of claim 1 wherein each of said word-error rates are 
calculated by comparing a correct transcription of said input development data and 
a top recognition candidate from an N-best list of recognition candidates provided by 
said speech recognizer after processing said input development data, said top 
recognition candidate corresponding to a best recognition score from said speech 
recognizer. 
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14. (Original) The system of claim 1 wherein said word-error rates are calculated 
to include one or more substitutions in which a first incorrect word has been 
substituted for a first correct word in a recognition result, said word-error rates also 
including one or more deletions in which a second correct word has been deleted 
from said recognition result, said word-error rates further including one or more 
insertions in which a second incorrect word has been inserted into said recognition 
result. 

15. (Original) The system of claim 1 wherein said word-error rates are each 
calculated according to a formula: 

WER = (Subs + Deletes + Inserts) /Total Words in Correct Transcription 

where said WER is one of said word-error rates corresponding to one of said initial 
language models, said Subs are substitutions in a recognition result, said Deletes 
are deletions in said recognition result, said Inserts are insertions in said recognition 
result, and said Total Words in Correct Transcription is a total number of words in a 
correct transcription of said input development data. 

16. (Original) The system of claim 1 wherein an interpolation procedure for 
combining said source models into one of said initial language models is 
performed by utilizing a selected initial set of said interpolation coefficients. 

* 

17. (Original) The system of claim 16 wherein a rescoring module rescores an N- 
best list of recognition candidates after utilizing said one of said initial language 
models to perform a recognition procedure upon said input development data. 

18. (Original) The system of claim 17 wherein one of said word-error rates 
corresponding to said one of said initial language models is calculated and stored 
based upon a comparison between a correct transcription of said input development 
data and a top recognition candidate from said N-best list. 
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19. (Original) The system of claim 18 wherein said selected initial set of said 
interpolation coefficients are each iteratively altered by a pre-defined amount to 
produce subsequent sets of said interpolation coefficients. 

20. (Original) The system of claim 19 wherein subsequent initial language models 
are created by utilizing said subsequent sets of interpolation coefficients, a rescoring 
module iteratively utilizing said subsequent initial language models to rescore said 
N-best list for calculating subsequent word-error rates, said optimized language 
model being selected by identifying said optimal word-error rate when a pre- 
determined number of said subsequent word-error rates have been calculated. 

2 1 . (Currently Amended) A method for optimizing speech recognition 
procedures, comprising: 

creating initial language models by iteratively combining source 
models according to interpolation coefficients that define 
proportional relationships for combining said source models; 

utilizing said initial language models to iteratively process input development 
data in corresponding ones of said speech recognition procedures for 
calculating word-error rates that each correspond to a different one of 
said initial language models; 

selecting an optimized language model from said initial language models by 

identifying an optimal word-error rate from among said word-error rates; 
and 

utilizing said optimized language model for performing subsequent ones of 
said speech recognition procedures with a speech recognizer. 

22. (Original) The method of claim 21 wherein said word-error rates are calculated 
by comparing a correct transcription of said input development data and a top 
recognition candidate from an N-best list that is rescored by a rescoring module for 
each of said initial language models. 
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23. (Original) The method of claim 21 wherein said initial language models are 
implemented as statistical language models that include N-grams and probability 
values that each correspond to one of said N-grams. 

24. (Currently Amended) The method of claim 21 wherein said input development 
data includes a pre-defined series of spoken word sequences from which said 
recognizer rescores a corresponding N-best list for calculating said word-error rates. 

25. (Currently Amended) The method of claim 21 wherein said source models are 
each similarly implemented as statistical language models that include N-grams and 
probability values that each correspond to one of said N-grams , each of said N-grams 
being the same in all of said source models . 

26. (Original) The method of claim 21 wherein each of said source models 
corresponds to a different application domain that is related to a particular speech 
environment. 

27. (Original) The method of claim 21 wherein sets of said interpolation 
coefficients are each associated with a different one of said source models to define 
how much said different one of said source models contributes to a corresponding 
one of said initial language models. 

28. (Original) The method of claim 21 wherein said interpolation coefficients are 
each multiplied with a different one of said source models to produce a series of 
weighted source models that are then combined to produce a corresponding one of 
said initial language models. 
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29. (Original) The method of claim 2 1 wherein said initial language models are 
each calculated by a formula: 



LM = Ai SMi + A 2 SM 2 + . . . + A n SM n 

where said LM is one of said initial language models, said SMi is a first one of said 
source models, said SM n is a final one of said source models in a continuous 
sequence of "n" source models, and said Ai, said A2, and said A n are said 
interpolation coefficients applied to respective probability values of said source 
models to weight how much each of said source models contributes to said one of 
said initial language models. 

30. (Original) The method of claim 21 wherein said interpolation coefficients are 
each greater than or equal to "0", and are also each less than or equal to "1", a sum 
of all of said interpolation coefficients being equal to "I". 

3 1 . (Original) The method of claim 2 1 wherein said interpolation coefficients for 
creating said optimized language model are selectively chosen by analyzing effects of 
various combinations of said interpolation coefficients upon said word-error rates 
that correspond to recognition accuracy characteristics of said speech recognizer, 
said optimized language model being directly implemented by minimizing said 
optimal word-error rate through a selection of said interpolation coefficients. 
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32. (Original) The method of claim 21 wherein a rescoring module repeatedly 
processes said input development data to generate and rescore an N-best list of 
recognition candidates for calculating said word-error rates by comparing a top 
recognition candidate to said input development data, said recognition candidates 
each including a recognition result in a text format, and a corresponding recognition 
score. 
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33. (Original) The method of claim 2 1 wherein each of said word-error rates are 
calculated by comparing a correct transcription of said input development data and 
a top recognition candidate from an N-best list of recognition candidates provided by- 
said speech recognizer after processing said input development data, said top 
recognition candidate corresponding to a best recognition score from said speech 
recognizer. 

34. (Original) The method of claim 21 wherein said word-error rates are calculated 
to include one or more substitutions in which a first incorrect word has been 
substituted for a first correct word in a recognition result, said word-error rates also 
including one or more deletions in which a second correct word has been deleted 
from said recognition result, said word-error rates further including one or more 
insertions in which a second incorrect word has been inserted into said recognition 
result. 

35. (Original) The method of claim 21 wherein said word-error rates are each 
calculated according to a formula: 

WER = (Subs + Deletes + Inserts)/Total Words in Correct Transcription 

where said WER is one of said word-error rates corresponding to one of said initial 
language models, said Subs are substitutions in a recognition result, said Deletes 
are deletions in said recognition result, said Inserts are insertions in said recognition 
result, and said Total Words in Correct Transcription is a total number of words in a 
correct transcription of said input development data. 

36. (Original) The method of claim 21 wherein an interpolation procedure for 
combining said source models into one of said initial language models is 
performed by utilizing a selected initial set of said interpolation coefficients. 
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37. (Original) The method of claim 36 wherein a rescoring module rescores an N- 
best list of recognition candidates after utilizing said one of said initial language 
models to perform a recognition procedure upon said input development data. 

38. (Original) The method of claim 37 wherein one of said word-error rates 
corresponding to said one of said initial language models is calculated and stored 
based upon a comparison between a correct transcription of said input development 
data and a top recognition candidate from said N-best list. 

39. (Original) The method of claim 38 wherein said selected initial set of said 
interpolation coefficients are each iteratively altered by a pre-defined amount to 
produce subsequent sets of said interpolation coefficients. 

40. (Original) The method of claim 39 wherein subsequent initial language models 
are created by utilizing said subsequent sets of interpolation coefficients, a rescoring 
module iteratively utilizing said subsequent initial language models to rescore said 
N-best list for calculating subsequent word-error rates, said optimized language 
model being selected by identifying said optimal word-error rate when a pre- 
determined number of said subsequent word-error rates have been calculated. 
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4 1 . (Currently Amended) A system for optimizing speech recognition 
procedures, comprising: 

means for creating initial language models by iterativelv combining 

source models according to interpolation coefficients that define 
proportional relationships for combining said source models; 

means for utilizing said initial language models to iterativelv process input 
development data in corresponding ones of said speech recognition 
procedures for calculating word-error rates that each correspond to a 
different one of said initial language models; 

means for selecting an optimized language model from said initial language 

models by identifying an optimal word-error rate from among said word- 
error rates; and 

means for utilizing said optimized language model for performing subsequent 
ones of said speech recognition procedures. 



11 



42. (Currently Amended) A system for optimizing speech recognition 
procedures, comprising: 

initial language models each iterativelv created by combining source 
models according to interpolation coefficients that define 
proportional relationships for combining said source models; 

a speech recognizer that utilizes said initial language models to iterativelv 
process input development data in corresponding ones of said speech 
recognition procedures for calculating word-error rates that each 
correspond to a different one of said initial language models, said word- 
error rates being calculated by comparing a correct transcription of said 
input development data and a top recognition candidate from an N-best 
list that is rescored by a rescoring module for each of said initial 
language models; and 

an optimized language model selected from said initial language models by 

identifying an optimal word-error rate from among said word-error rates, 
said speech recognizer utilizing said optimized language model for 
performing subsequent ones of said speech recognition procedures. 

43. (New) The method of claim 26 wherein said different application domain 
alternately includes any of a news domain, an Internet domain, a financial 
information domain, and a spontaneous speech domain. 

44. (New) The method of claim 21 wherein said source models may include any 
number of different individual language models. 

45. (New) The method of claim 21 wherein said source models are implemented as 
finalized language models that are individually capable of being separately utilized 
for performing said speech recognition procedures before being combined to produce 
said initial language models. 
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